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IBSTBACT . • 

The National Longitudinal Study of the High School 
Class of 4972 (HLS) is a large--$cale longitudinal survey effort. The 
current data tapes coptain base-year (1972^ survey data integrated 
with first foil oif- up (1973) survey^ data. The ^ase-year data consist 
of responses to a Student^^^esttonnairer tl^e results of. an ability, 
test, .and inforaation froi the Student's School Record. Information 
Fofp. Follow-up dat-a consist of responses to a tailed >g>iestionftair^.^ 
The pre-itchine data processing consisted of coding alphabetical ^ 
information and "errors", such as partial responses or legitimate ' 
non-responses. Subsequent machine editing included range checks, 
consistency checKs, and routing checks. There are 22, 398. records in 
the HLS lerged base-year/first follon-up .data file—one record for 
each respondent. Each record consists of 921 variiables. The dajta 
contained in ^ach record in .the file are generally grouped ^y type 
and ordered as follows: identification' cod^s, data indicatolrs, tesi 
battery data, student* s School Record Information Fori data, " 
base-yiar gtudelit Questionnaire data, Fl^st • FollowrOp Questionnaire 
data,, quality indices sampling wetights^ sch*ool variable data , and 
analytic indices. These tapes are now available for t)urchase froa. the 
Rational Center for Education. Statistics. (BW) ' / 

, ■ ) ■ ■ . 
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• . . ' ABSTRAiT • ^ , 

•The National \ongitudinal Study of -the High School Class of 1972 is 
a large-scale longitudinal survey effort- NLS dSta have been gather€d^ 
and combined from several sources, coded and edited for analysis, purposes 
and stored in'magnetic computer tapes. The current' tape package, contains 
base-year (1972) purvey dat^ integrated with first follow-up ^ata (1973)- 
This paper provides a gen^i^al description df the contents of the tape 
package which will soon be available for u|e By the general ,r^earch com- 
munity. ' ^ • 



_ I. INTRODUCTION 

The NatioTial Longittidinal Study (NLS) of the High School, CJass of 
197? is a large-scale longritudinal survey ef for^t primarily supported by 
the National Center -for Education Statistics (NCES), Office of the 
Assistant Secretary for Education in the Department j3f Health, Education, . 
and Welf Jte JDHEW) • ' . " * / 

The prinjary purpose of the NLS is' to discover what happens to young 
people after they leave high school, as measured by their Subs^uent 
educational and vocational activities, plans, aspirations, and attitudes, 
and ta relate this inf6niiation to their prioj educational experiences 



and personal and biographical characteristics- Ultiiaate]^, the study 
will allow a better understanding of the development of students as they 
pass through the Americarueducational system, and of the complexity of 
factors {associated with Individual educational and career outcomes. , 

♦ • l^S data have been gathered and combined from sevei;al s^jjdrces, coded^ 
and edited for analysis purposes,- and stored on magnetic computer tapes 
for future access. The 'current tapes contain base-year (1972) survey 
data, collected by the Educational Testing Service, integrated with \ . 
, first fallow-up (1973) survey data; collected by^ the Research Triangle 



Institute. "^5Jiis. tape package wij.1 be augmentad periodically as data » 
from subsequent follow-up' surveyS become avail^le*" 

The major purpose of this paper is to. describe the content-s of the * 
available NLS tapes and-, by so doi-ng, stimulate the interest of the 
educational research community, , ' ' 



* II. THE SAMPLE - ^ 

• , ' * r ^ . ^ 

"The merged -NLS data* file conjt^'ins data^ for a total of 22,398 sample 
members who responded to the base-year Student Ques.tiomiaire And/or the^' 
Firsts Follow-tJp Questionnaire, Comprising this taljal group are 15,635 
individu'als who responded ^to both survey questionnaires, 1,048 who responded- 
to the base-year questionnaire only^ and 5^715 who responde^to the followr 
up questionna^ire only (al^tehoUgh some base-y^ear information was collected 
retrospectively for 4,539 of these individual^, . > ' . • 
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,Thes*e respondents .were sampled as 1972 twelf th graders from .a 
. national probability sample of 1,318 public^ private, and church- 
affiliated high schools located throughout .the 50 States and the 
District ^f ^Columbia, The, scho'bl, sampling f rathe .wafe stratified into — • 
600 final strata, .based on the following seven variaoles: /, >, 

Type of control (pub^llc or nonpublic), '5* , 

- Geogi^aphic division (Nottheast,, North Central, South, West), ^ ' . 

- Grade 12 enrollmeat (fewer tban 300^ 300-599, 600 or more), . 
. - Proxifeity to institutions of higher, education, ^ ^ 

/ - percent minority group enypllment, \ ^ • % 

Income level of the cdmiaunity around the school, and ^ I 

- Degree of urban iza^t ion. " ^ 

Students from low-income families and racial minorities were over- . 
sampled so that sample sizes for these groups ip the national s^ple .would 
be adequate for separate arialyses. ^ ,* » / 

. " Tl^ study designs-excluded schools fot the physically or mentally 
handicapped, schools for legally confined students, 'or schools that did 
not enroll students of thfir own (such as area vocational schooj^s whogte 
studentS|^ were enrolled ij> other schools included in the sample). Also , 
'excluded were certain classes of st^idents, such' as eaiqly graduates and '. 



r 



adult education students. 



.III. IiNSTRUMEICTS AND TEST BATTERY 



A. ' Base-Year Instrumentation 

Each student in the sample Vas asked -tc complete a*Stu4enO Question 

• . ^ V 

naire containing^ 104 questions distributed bver 11 major siections. The 

base-year Stucier\t Questionnaire dealt with . factor's irelated'to the' student 's 

• . : ■ ^ 

personal^f amily backgroand, educational and work experiences, plans, 

Aspirations, attitudes, . and* opinions. In addition, each student was. 

. « V ' - "N * 

asked to 'complete a 69-minute Test" Book designed by the Educational ^ 

. Testing Servicje to measure both verbal and nonverbal ability. The Test 

Boole consisted of €lx tests, incl<jdin*g vocabulary, picture number, • 

reading, letter groups, mat;hematics, and m'osaic comparisons. 
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' BasB-year Hata were also obtained .from a Student's School Record 
•Information Form (SRIF). Items on the SRIF pertained to the student's" . 
high .school curriculum; grade-point average, and credit. bours 4n major^ 
Cwrses. ' If apt)licable, positions in ability groupings,' reraedial- 
J^nsUfu^ction records, involvement in certain federally supported programs', 
and /secures on standardized te^ts were also included. 

FinalJ.y, information from a School Questionnaire ^nd one or two 
Counselor ,Questionnaires^were also collected for each participating high 
school. - * . 

B. ^ First ^llow-Up Instrumentfat ion , . • , 

Two fomrs (A and B) of a First Follow-Up Questionnaire were de- 

velppeci and designed for self-administration by the student. Fprm A of 

the Firfet J'ollow-Up Questionnaire was mailed to eaqli sample member who 

responded to the- ba'se-year Student Questionnaire, Seniors from, the high 

school, class of 1972 who were unable to participate in the base-year 

/Survey (usually because of time and scheduling considerations) wer-«-^ 

mailed Form B^of tliie questionnaire. Qufr^fionS one through 85 were 

' .* _ * ' • 

identical on both .questiionnaire forms. These qtleStioris dealt with 

4' - * * 

information ^concerning^he respondent's activity state (e.g., education, 

\^rky Btc.) in October 1972 and October 1973, his or her socioeconomic 

status, work and educational experienced since leaving- hij^ school, and 

future educational and career plans, aspirations, and ^pectations. , 

Form B of the First Foll6w-Up Questionnaire also cpntained an additional 

14^ questions from the base-year Student Questionn^re. Most of the .\ 

\tems of the base-year and first ioliow-up instruments are of "the -forced- 

choice^type" Open-ended, or f ree-response,_j^^s wete .limited to quee- 

tions involving dates, income, number of hours (ST^^weeks ^rked and the 



■ IV. DATA COLLECTION PROCEDURES , * ' . 

A. -Base-TYe'ar Datra Collection . . • . , 

~- ^- . ' ' S — * . - . ^ ■ ' , 

the 1,31^ school^, 1,070 were able tp participate in the^base- 

year survey, resulting in 16,683 completed Student ,Queg^tionnaireSf 

The bvik of the student data was collected ±xt April, May, and June^of 

1972, through gr'oup administration in each schoq(l- Survey administrators 

also caapleted School Record Infcrmation Forms (S11IF%) for j^c^ partici- 

patlng student, plus approximately 1,000 additional student's during the, 

summer of 1972. It was intended that these additional students, who did - 

not com^lV^e the Student, Questionnaire' in spring 1972, would be sent 

Form B of the First Follow-Up Questionnaire so ^s 'to be able to obtain 

some base-year student information from the resjfonJents. 

B« First Follc5w-Up Data Colle ction 

* * — — i{ 

*Jhe first step in data^oliection for first follow-up inlrolved an 
^extensive tracing opera&ion_to update -name and address -files. The 
Research Tria*ngle Institute received from the previous contractor a name, 
and »adiress fiX,e from the base-year survey, A total of 18^672 of 
these' individuals from 1,0A3 schools were used for the first follow-up 
survey. In addition, 257 resurvey school^ with 4,450 individuals ^ 
were added to the base-,year lists to give a first follow-vip survey 
sample of 1 ,'300^ Schools and 23,122 indij/iduals, A newsletter was de- 
veloped and mailed* in July 1973 no*t onrly to encpQrage participation but 
also to use as a vehicle for updating names and addresses. When mail 
was returned by the postal Service as-otideliverable, telephqne tracing 
•procedures were used to obtain current addresses where possible. 



/ *The list of ^esfirvey students was provided by, the S, Office of 
tkiucation, . ' ^ * 



' Prior to tlte mailout pf the first .Follow-Up 'Questionnaires, 102 . 
individuals Wfere deleted frpm the mailing list for operational reasons * 
{e^g., duplicate names, ^bad addresses, deceased). ^ ; 

Questionnaires \Jere mailed to' the -last kjiown* addresses of the 
resulting sample qf 23,020 on 23-24 October 1973, This was follpwed by 
ar^lai>ned sequence. of reminder '^postcafds, ^dditioiral questi6nnaire 
mailings, #and reminder mailgrams to nonreapondents, , Active mail return 
*ef fcrtsycontinued thr'ough DecCTibfer •1973 ; hy early January 1974, the 
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questionftaire ret^irn rate by mail was 60^9 perce 

The names «nd addresses of tho^e sample members who failed- to mail 
back theii:^ questionnaires were then turned over to the Bureau of the 
Census ^r personal interview in accordance tdth a Bureau arrangement 
with the U» S. Office of Education. This personal interview phase of 
first follow-up dat^ collection coRtinUed until April 7, 1974, atwhich 
time the overall response rate had been increased to 92.8 percent, . 
21,350 respondents out of 23,020. ' > • ' 

V. FIRST FOLLOW-UP DATA PREPARATION 
A, gre^Iachine Data Processing ^ | 

^ ' Questionnaires returned** by mail , either from individual sample 
members or from the Bureau of the ^Censtis ititer^iewers, were manually 
edited to ascertain if each questionnaire' contained a minimum set of 
"critical" data. Questionnaires which passed .this manual edit stage 
were*th^n transnitted to thft direct data entry section to be transformed 
into machine-readable format. Those questionnaires ^hat failed the 
manual .edit procedures were routed to the 4;^lephone 'follow-up section, 
- where an attempt was made to- contact the respondent and recover missing 
information or otherwise resolve problems uncovered' in manual edit. 
After resolution, these questionnaires were also transmitted to the 
direct data entry section for encoding. In general, the^encoding ^ 
process- involved transcribing the questionnaire responses onto magnetic 



t^e. . However, there were certain categ/ories of /^^stionnaire ittos ' 
that requ|r'eci^ some manual edi'ting before kransctiptioni i.e. , ^estions# • 
dealing \>itl| occupations, flel/ds of study, s'chool naines, and thos^ i^&ns 
that h:ad"an alternative labeled "other" and allowed a write-in response. 
ManuaX .reoodirig of alphabetic information was p0rfolnuad, prior tdVrapBcrip- 
tion into machine-r eacjatle form'. Questions conc^rnitig^ the respondent's , 
(or his parentsp') *9ccupation were recoded into the c'orr.espond^jg three- 
xiigit codes sp^dcif ied . in the Census Ind^ of. Ii;idustries and Occupations. 
Postsecondary ^(ihDols identified by resp'ondents were recoded i^to six- 
digit 0, S. Office of Education Vendor or FICE* codes; Using a master 
index provided by QE (these ^godes^have been witheld, however, in ordet 
to prt)tect the confidentiality of respondents). Fields of study reported^ 
^by 'respondents wl;io attended sctiool^ duj^ng the post-high' school perio'd 
were recoded into HEGIS ^c^egorieslTusing both the foiur-digit academic 
subdivisions {provided by the^HEGIS taxonomy and the six-digit HEGIS - 
technological and occi/pational schet&es being developed* Finally, responses 
indicating a ty-pe of/license, certificate, oii^diploma received were 
s;unilarly converted to numerical codas. , i 

There were 18 questions 'in the First Follow-Up Que3tionnaire which ^ 
^had closed, of fixed-response ialternatives and which included an "other"* * 
option with room for the respondent to write in his-pikrticular answer. 
If the "other"' category were chbsen by th^ respondent, that response was 
reclassified whenever possible into the closed-choice Options provided. 
V/here reclassification was not an obvious or logical possibility, a code 
indicating' use of the '*other" option was inserted, but the alphabetic ^ 
description was not includeci on the data file. 'A 

Ther^ were four questions asking for alphabetip information in the 
First Follow-Up. Question|iair% for which no numeri£ coding was done. 
^That is, the ^vrritten rei)lies of the re^ondents were coded as they . 
appeared oiT^he questionnaire.^ The user should be aware of these 
alphatatic fields and kxse the appropriate computer. techniques td process 

them. . ; ^ 

Beyond recodin^of alphabetic material, a uniform set of standard 

numeric "error" codes was defined an(^t>plie^, ^^^oss the file to indicate 
certain conmon classes of erroneous or missing data. These class^ of 
^response are listed below: * • * 
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- Partial Response / This code applied only-to those V^stions ^ the - 
First Follow-Up Instrument that eTrfpioy t^he two-column response^ ^ 
format. .Each of these questions 'consists df ^ set of subitems and 
requires The respondent to indic^ate whether each subitem* applies 

. pr do.es not apply/ If the^ respondents answered at least pne, but ^ 
not all,' of I the subitems "»r a qirestion of this kind,' then those 
- , unansT^efed '?iubitems were coded to indicate a partial response to 
tjie question. ^ ^ * - ' ^ " 

- Don't Know , (self-explanatory) . " 

- Out-'of -Range Response . This code is ^sed when the response or 
transcription exceeds some specified acceptable r^nge. It is 
describecf more fully in the following section. — ^ • 

- Multiply Response . This is usfed if the respondent gave several 
. -answers] to a question when thfe direct ionS called for only^ one. 

• '> ' • 

- ilefusal. (self-explanatory) ' ^ . ' 

^ I , ' , 

• Blank, or Inappropriate Nonresponsfe . This is used, when the respondent 
should have_^ answered the, question and dfd not^ 

. Legitimate Nonresponse , Thfs 'is used when the respondent shduld 

not have answerecT the question and did not. As welP, if the respondent 
did^not answer an entire Instrument, then all. fields representing 
that instrument werfe. coded 99. ) I .- 

The steps desqribed above were the only manual editing procedures 
applied to the data. T^ie manual editing was limited to insure, as^imich 
as possible, ^ that the data file would accurately reflect the actual 
response oh the questionnaire. /Jt' was felt that any^futther editing 
should be ,done by machine to insure uniformity of application and — - 
rcplicability . . ' • • \^ *^ 

, / VI. MACl^INE EDITING PROCEDURES 

Some of the major steps/taken toward preparing the NLS data tapes - 
for public release included "hard copy" (source docutnent) spot checks, 
and feachine editing^ which involved recoding ^l*l«.nterptetable responses 
and sonfe logical recording of other responses. As a result, the- final 
data file contains only .(a) valid response -todes, (b) codes describiq| 
type of erroneous or missing data, .and (c) "logically recoded values," 
including, in £ach case, an indicator for thf reasons for such a r^code. 



Three sequential 'machine editing programs were employed 'and are 
desct^tbed below- (These programs were not applied to the NLS Ua^e-year 
data," since-th^se 4ata were already subjected to editing procedures 



employed by me previous contractor. The base-year data were siinply » 
.r^ornTatted* and in Some cases recoded,- to achieve consistency with the 




firsjt follow-up data on the file.) 

A. ' Jlange Checks 

* The first program in the^dit ingTsequence dealt with out-of-range 
data. This program* Checked the responses to each fixed-response item 

•against a specif i|d range of ' acceptable values, "flagged" an^ v^lue^that. 
fell outside of this range^ and recoded each flagged datxm to indicate 
the occurrence of* an but-of-range resp^orise. It. should be noted tWt s 
this recoding procedure Uas applied to fixed-choice items "only. Acceptabl 
ranges we4fe also specified for 79 free-response items c^%ing for tiumeric^ 
data, and the frequency, of responses outside these ranges has been 

.tabulated. In general, these responses were logically possible but were 
considered highly improbable. It is recommend ed^that the user make hi* . 
own decisions as to what .is acceptable data for these items, using as ^ 
guidelines the ranges set* forth. It was felt that in some cases the^ 
outlying responses may provide a"dditional information and that it wa? 
best to leave- th^ in the file so as to provide as faithful a transcrip- 
tioil^of the original records as ^possible. , ' \^ 

B. Consistency Checks^c - » • . , 

— ^ 3 ' . . . ' . ^ * 

Phase two of the machine editing sequence was concerned xd.th icheck- 

ing the consistency of an individual's responses over the entire question- 
naire: A set oJ 94 internal checks, d*r H^esponse comparisons, were ^ 
selected on an a pricfri basis for this ijurpose. ^The consistency .prograi^ ; 
read the responses comprising each individual ' s tre(|^rd and flagged those 
consistency checks that were failed./- As will bOsLti escribed, in a later 
section,' an index (^s subsequently cpmpiited for^ach record based on the 
number of Censistency checks failed by the individual, '^hi^ndex 



reflects the internal .ccnasistency of* a record and, tfier^ore*, 'proi'id^'' 
the user with .a ^ough indication of t'he^ cfuality c?f ^ach respondent's 
d^ta. • ' ^ .. ' 



-% ^ < ^ ^ . • ' i ' 
C Routing Checks ^- • ' ./ ^ ''^^ . ^ '^•^ - , ' / 

The .first; f ollow--up' insttqiienbtrantla^Tts 33u^outing .questions', ' ^ 

' ' ' "vV:.>.. .4 J: v^? ^ ■ * • . , ' 

routing, question is xm^that eith6''t^i*ipli'citly or explicitl^direots a 
r.esi?pndent around t)ther. (questions ia the* instrument^ The aim of the. 
touting questions is to quietly move respondents around <[uestionha^ire 
sections that do not apply to them. Ift order to det€!rraxne if the i^^e- * 

A- e . ' ^ . * * " ^ 

spondents correctly followed the routing ijattern^'i a routing |check *\ 
program was developed and implemented, 'This program read each record' * r 
and flagged respronses to all routing ijuestions that Vere inconsistent^ • 
with the subsequent patt,ern*of response; rfecoding these quest io^^ to I 
indicate the* type of , inconsistency det^ted. Three types of i-ncon- 
•'sistency have been identified ' and recoi'^^^^in tiie data^file: j 
f 1. Wbetf ttje response .to a routing i^em indlwetes tKat the ques- 



tions associatSJT^ith that 'item (i»e., cx)ntained within tlje 

^ routing pattern) should be skipped but ar^ not- ^ 

2. When the' response to tTie routing item indicates tl)at the ^ 

questions'associated with that item shduld ba answered btit are^ 
• » • / >* ^ 

not. 

' ' *"* 
^ 3. (Actually- a combi^latian of 1. and 2. above.) ^When the response 

to. the routing itm indicates t^at certain questions should be^ 
" . ^ skipped but are not (type on^Y ^h^fS^ other qdestions shauld- be 
# ; answered but are not (type IiwoX. \ 

One other function of the' routing check program was to dif f erenti^ite 
between legitimate nonresponse (coded 99) and^illegifrimate nonresponse 
(coded 98). Legitimate nonresponse. wa^ defined' as ncinrespoitse fb ques- 
tions hat ' the respondent has bfe^n routed arbund. If a respbndegit was \ 
routed into* a ,blocJ^, then acity Jionresponse to those items^was considered , 
illegitimate;. Al^o, if the toutig pattafrT was answered' in ^a iaanner ^ 
inconsistent* witT) the* routing- instructions, th^ the nonresponse wsts 
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• cotisldered illegitdmate for th^se items. Th|»4>nly time that nonresponse. 
would be coded 99 (legitimate. nonrel|[onse) was when thje respondent, had J ' 
an unf la^ge^d responsfe-rto the ipouting question that routed him around a 

" group of^quest ions r ^'he-^ef f ect^ of this coding for nonresponse was to 
overestim^e the ftlegitimafce nonresg^se. That is, 'irt cases wh^re a. / ^ - 
respondent's pattern, of. response does not give a clear indication, which . 
questions he should an«weY, then the nonresponse ia thaf p'^ttfern was 
.coded 98.- In some of/ the jior^ comply routing patterns, nonresponse *^ » 
\{ill4>e coded' illegitimate (98) for a l^rgd section of Ipems due to one " ' / 
inAivsistency. ^This implies that the useplshould be qi/lte cJ^^u^|^^ ^ 
interpreting the 98 and 99* codes.. For some an^lyses^^^thgf^nr^^^ 
t*© redefine, legit imat^e and •iljLegij.tioate skips. ^ w i^im- # ^ 

.VII. CONTENTS AND ORGANIZATIOJJ OF THE PATA FILE ^ ' ^ 

* . ' * ~ ' ^ w , ■ ' , 

.*There are 22 ,398 jfrecords in the NIS meirged. base-ye^r/f irst f^loxo- 

up data file — oqe record for each^ respondent. . Each recprd consists of . 

I 921 variables. The complete release tape .vari^le and response' list^ 

are presented in the NLS Data User's Manifal (L^vin^ohn, Riccobono^ and 

>Ioore, X975). These computer geneifated lists proyide a detailed account 

of 'the data ,otgani2atioii^ithijl\„a: retol:<? ' ' Tho, varidj^^ contains ^ ; ' 

. the name and' description of each'variable, th^r.f ield or cS^ractet positions \ , 

* Containing each variable,*' and a response -^st reference code f% each -, . 
variable. The response list catalogs the valid 'response ^cedes for gxv|p 
types of variables in the variabl^ list . * j ' . ' 

It 'should be mentioned "that a number of vaifi^J|i^s, or items", from . 

'ea'eh of , the JILS data collection instr^ents were tibt included in the ^ . 

' release file. • Item^Were* excluded iir?iinaTil|;< fof reasons of confidenr ^ ' 

tiality, but ther^ were also a, number of itefas which were deleted or 

modified because of excessive prior editing or .poor response. 

' r .The data contained'in each record in the file ar el' in general , 

.groap'fed J>ir type *and;' ordered as follows:' indentif ication ^odes,\ da4a 
** * ' ' ' ' ^ 

■'indicators^ .tesn bajctery data, student's School Record Infonhatiorv' Form 

(SRIF) J^ata, base-ye^r Student Qu-festionnadre data, ?ir?t 'Follow-Ulp , 



Questionnaire data^^^qtrSTLty indices, sampling weightf, schoot-fvariable^ 
*'datA/ajid analytic indices. Each of these data grofips is briefly 
discussed in thieu following paragraphs. ^ - ' . ' 

:r— g ^ _ _ . ^ ^ ^ . . . ^ . . 

' • ''Two different identification' codes appear at th6 beginning of each 
. record 1^'' a random ^ivQjdxgit student ID'tlumber, and' a ^six-digit school 
Cbde.tha^ay be used to group together students *from the same t^igh 

^^Severdl indicators have be^ included in the release fi]*e to assist 
users in processing* the. dat^ records. These indicators signif^y the ^ 
..^presence or^absence o^ First .Follow-Up Questionnaire data, Te^ BdoV . 
. data, SRIF data, and base-year Student Questionnaire data.. <r. : :\. 

, As noted earlier, each student, in the base-yeat survey, was asked to 
.^complete battery of six tests, including: vocabulary, picture-number 
<a ' two-part te3t), reading], letter groups, mathematics^ and mosaic 
caparisons (three parts)-" Total te.st scores ^we?:e computed for eath of 

* thes§ tests. In addition, subtest scores were cpmpute^l^parately for 
each, pa^t of the picture-number and mosaic cSbparisons. tests- Thus, the 
t^st battery dat/ for^ each tecord consist of 11 subtotal and totalj 

[scores in all. For each. test or subtest, formula scores were computed 

' - ' ' ' ^ \ 

from, tfie item response ast ' • / . 

i^S — R-W/(C-1) . - 

where 

FS = 0 Formula 5care ' • 
R = Number right • f ^ ' ^ 

W = Number wrong 
.C- Number of response to item. 
^'^as.e test scores were also standardized^ across The sample, and the 
.^-^t^Qdirdized scotes {y^k mean of 50 and standard deviation of 10) 

• stored on the file. . 
" * Data from* the SRIF and base-year Student Questionnaire were coded 

and edited acoording to the previous contractor's specifications. SRIF 
data in the flle>inc^de student average,. grading system used by the 

' student's high school, lowest^ and highest grad"es possible in. the school 

■y 

' griding system, ?nd student^s percentile rank in'oli^s. 
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Base-j^ear and First Folliiw-lTp Student' Questionnaire iata were, Jwith 

few exception^, coded in a format that, was a one-to-one match with the 

*• . I 
instruments. 'T^hese* data constitute t!la^ 'bulk of the data file. 

" Unadjusted and adjusted student weights were oalctila£e<5 and stored 
.on the NLS data file.. Unadjusted stodent weights were cfalculated^as the 
inverse of tjhe sample iWlusionJ probability for all students sampled. 
A w^rtghting claps method was employed to calculate six different sfets of 
' student weight^^ad justed for nonresponse; each' weight se-t is^appropriate 
/for analyses involving a particular set of data. ,The methodology and 
usfi of the v^ious weights are described in detail in the .NLS Data 
User*s Manual'' (L?vinsohn/ Riccobono, and Mdore, 1975). \ 

Only three^variables from the School Questionnaire were selected 
' , f orjinclusion in each student's data record. They^are: yegion, t)^e of J . 
* community, and senior class ^enrollment . 

Two kinds of composite Indices — quality and analytic — are included 
in the NtS data file. Fo.ur quality indices have be^n developed in order 
to qjuantify the amount and quality of first Follow--Up* Questionnaire *data 
present in each record. •They are: XjQJ a consistency^nd^, representing 
the- percentage of the 94 consistency dhecks failed by an individual; 
(2) an out-of-range index, representing the percentage df- out-of -range 
responses for an individual's record; (3) a routing index, representing 
the percentage oi routing^ questions ambiguously an$wered by an individual,- 
i.e., routing questions that were unanswered or answered in a manner- 
inconsistent w\th other response^; and (4) a completeness index (for 
, each major sectioa of the questionnaire), representing the precentage of * 
^^i^ems,x^rLth valid responses, i . e. , 'responses that are^not coded as, errors 
or missing data. ' * 

Users should b^ cautioned that the. utility of these fluality indices 
is in the determination of the credibility o^f individual records. ^S^y 

are of no use in making judgements about data*^ch as \iteifl responses when 
• , » • *' • 

'^considering more than one respondent, sitic^ the real tesjt of item response 

, quality i3 the over-subjects' distribution. These indices should not be 

considered for discarding 9^b|^cts unless one^s cpncern is with the 

4 '-^ entire instrument.-, • ^ • * ,* 



•i 



Also included in'-the-data file are two composite indices, ability . 
an'd socioecortomic atatus (SES) , useful for marry analyses. In cotitrast 
.to the quality indices whi^h focus on a single individual arijd >[uant if y 
the amount and quality of information present in an indavidyal^s record. 



the analytic indices are derived from glottal ^considerations of the 



Lytxc ^ — * — , ^^^^^^ ^ ^^w^^w^ , 

entire and. serve primarily as classi-f^fatory, variables by which 



one can%group individuals. Thus% each, r^ivi'dual was assigned a code of 
1, 2^ or 3 depending on' \^ether his ability or SES composite score was I 
in* the lower, middle two, or upper qiiartile range- 

^ Both of the analytic indices involved several components an<i required 

** ♦ "* 

several steps in their derjLvation, It should .be added, however, that since 
other procedures and components may'n>e ^ployed in deriving^ji^ch indices, % 



the ability and SES raw scores wei;e also; ^ncluded in the file, and users 

are encouraged to decide for themselves^ whether- the derived indj^es are 

appropriafe for their paifticular needs or purposes. 

« Table 1 presents a br^eakdbwn of thftvkiitfls and amounts of data 

available on the NLS r^ease file for important subpopulations. 

' , ' . " I' ^ ' 

* ' j ' ^ J , * . . • 

• VIII. HOW TO OBTAIN "DATA TAPE ' 

* A set of ^NLS tapes containing.-marged base-year and first fpllow-up 
(first level of edit) data is noW" av^l^ble for purcha'se as two- 2,400 
foot reels of 1600 BPI^densjLty at ^' cost of $156.00 per set. This 
purchase price also includes a User s Manual • Information pn how to 
obtain a set of these data tapes and file docum'enta^^n may be obtained 
from Robert A, Heinrtze, National Ceate^ for Education Statistics, 
Room 3069, Fo'b-6, 400 Maryland Aveniie, 8W, Washington, 'b. c; ^20202- * 




able 1. Data: av^labllity for 'subptTpulatlons by Instrument 
(N - 22,398) ^ ' 



INSTRUMENT 



SUBPOPULAnOH 



Bas6-^ear (B-Y) 
Student Quest*. 



First Follow-up 
(FFU) Quest. 



SRIF 



Test Battery* 
* \ TB 



B-Y, FFU 



FPU, SRIF 



B-Y 
FFU, fa 



Sex: 



naie 




10,463 


10,233 


7 ,894 


7,665 


7,65fi^ 


7,307 


7 ,301 


Female 


8,397 


^ •' 10,841 


10,376 


7,953 


7,967 


7,957 


7,553 


7 ,544 


cnclasfllxlable 


13 


46 


42 


12 ; 


3 


3 


3 


3 


Race: 


\ 
















White* 




. 15,272 


\ 14,721 


12.111 


11,949 


11,938 


11,442 


11,433 


R1 




7 71Q 


' 7 7 


if 90b 


i,y-zu 


1 Q 1 7 

1 ,yi / 


1 , /bb 


1 7C7 

X • /bJ 


Other 


1, bU J 


1 Q'>Q 


T QIC 
l,OlD 


l,5Uo 


1 ICQ 

l,4bo 


l,'ib3 


1, J/9 


1 • J7b 


unciassi£i.aDie 


J 


* 1,j1U 


1,522 


334 


298 


298 


0 ^ C 

27b 


«. 276 


H.S. Program: 




S, • 










<?» 




Acqdemlc 


6,811' 


8,511 


8<312 


4i5^1 


6,ieft 


6,465 


6,2Q/» 


6,201 


, General 


5,665 


. 7,492 


7,253 


•573^3 


5,235 


5,224 


4,955 


4,9^i4 


. Vdc/Tcch ^ 


4,201 


5,148 


5,063 


3,956 


3,927 


3,924 


, 3,^99,' 




Unc lass 1 flab Xe 


6 


,199 ^ 


23 




5 


5 


5 


' 5 


. Rej^ion: 










' 




* ^ 


* 


North . 


3,618 ' 


\ 4,483 


4/3I6 


3,321 


3,420 • 


3,420 


J' 3,323 


3 ',323 


Central S 


4,568 


*5,541 ' 


5,468- 


4,122 


4,292 


4,288 


3,875 


3,873 


South 


5,513 


-7,691** , 


'7,242 


5,382 . 


5,186 


•5,178 


5,057 ' 


5,049 


West 


* 2,984 


3,635« 


3,625* • 


^ 2,834 


2,737 * 


^,732 


2,608 


2,603 


Unclasslflable 






— ^% 












Ability: 










» 








Low 


4^788 


. '4,392 ' 


4,783 


4,798 


' 4,382* 


4,374 


4,382 


4,374 


Medluta 


. 7,000 


6,600 . 


6,997 


7,008 


^,592 


6,585 • 


, 6,592 


t,585 


High 


4,052 


3,890 


j 4,052 . 


4,053 i 


3,889 


1,889 


• * 3,889 


3.889 


Unclasslflable 


843 


• 6,468 


' 4,819 




772 


770 






SES:: 
Low 


^;076 


y 

6,423 


6^227 ^ 


-4,775 


4,735 


4,729 


i' 

4,458 


4,453 


Medium 


7,816 


9,635 


9,393 


7,448 


7,3*20 


7,310 


6,971 


6,962 


High ^ 


3,667 


' . 4,686 


4,499 


3,5^5 


3,506 


. 3,50Sj^. 


3,370 ' 


3,369 


Unclasslflable 




606 


532 


111 


74 


74 


64 


64 


Total 


16,683 


. 21,350 


20^651 


15,859 


15,635 


15,61$ 


14,863 


14,8r8 
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